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Dear Examiner Holmes, 



Attached please find your search results for above-referenced case. 

Please contact me if you have any questions or would like a re-focused search. 
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File 275:Gale Group Computer DB(TM) 1983-2004 /Sep 16 

(c) 2004 The Gale Group 
File 47:Gale Group Magazine DB(TM) 1959-2004 /Sep 16 

(c) 2004 The Gale group 
File 75:TGG Management Contents (R) 86-2004/Sep Wl 

(c) 2004 The Gale Group 
File 636:Gale Group Newsletter DB(TM) 1987-2004 /Sep 16 

(c) 2004 The Gale Group 
File 16:Gale Group PROMT (R) 1990-2004 /Sep 16 

(c) 2004 The Gale Group 
File 624 :McGraw-Hill Publications 1985-2004/Sep 15 

(c) 2004 McGraw-Hill Co. Inc 
File 484 : Periodical Abs Plustext 1 98 6-2004 /Sep Wl 

(c) 2004 ProQuest 
File 613:PR Newswire 1999-2004 /Sep 16 

(c) 2004 PR Newswire Association Inc 
File 813: PR Newswire 1 987-1 999/Apr 30 

(c) 1999 PR Newswire Association Inc 
File 141:Readers Guide 1983-2004/ Jul 

(c) 2004 The HW Wilson Co 
File 239:Mathsci 1 94 0-2004 /Nov 

(c) 2004 American Mathematical Society 
File 696: DIALOG Telecom. Newsletters 1 995-2004 /Sep 15 

(c) 2004 The Dialog Corp. 
File 553: Wilson Bus. Abs. FullText 1982-2004 /Jul 

(c) 2004 The HW Wilson Co 
File 621:Gale Group New Prod.Annou. (R) 1985-2004/Sep 16 

(c) 2004 The Gale Group 
File 674:Computer News Fulltext 1989-2004 /Aug W4 

(c) 2004 IDG Communications 
File 88:Gale Group Business A.R.T.S. 197 6^2004 /Sep 15 

(c) 2004 The Gale Group 
File 369:New Scientist 1994-2004 /Sep Wl 

(c) 2004 Reed Business Information Ltd. 
File 160:Gale Group PROMT (R) 1972-1989 

(c) 1999 The Gale Group 
File 635:Business Dateline(R) 1985-2004/Sep 16 

(c) 2004 ProQuest Inf o&Learning 
File 15:ABI/Inform(R) 1 97 1-2004 /Sep 16 



(c) 2004 ProQuest Inf o&Learning 
File EBusiness & Industry(R) Jul/1994-2004 /Sep 15 

(c) 2004 The Gale Group 
File 13:BAMP 2004/Sep Wl 

(c) 2004 The Gale Group 
File 810:Business Wire 1 98 6-1 999/Feb 28 

(c) 1999 Business Wire 
File 610:Business Wire 1 999-2004 /Sep 16 

(c) 2004 Business Wire. 
File 647:CMP Computer Fulltext 1988-2004 /Sep Wl 

(c) 2004 CMP Media, LLC 
File 98:General Sci Abs/Full-Text 1 984-2004 /Jul 

(c) 2004 The HW Wilson Co. 
File 148:Gale Group Trade & Industry DB 1 97 6-2004 /Sep 

(c)2004 The Gale Group 
File 570:Gale Group MARS (R) 198 4 -2004 /Sep 16 

(c) 2004 The Gale Group 



18/3, K/l (Item 1 from file: 275) 

DIALOG ( R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01868529 SUPPLIER NUMBER: 17610980 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Document imaging products help schools eliminate "paper jams." (includes 
vendor directory) (Buyers Guide) 

Carmona, Jeff 

THE Journal (Technological Horizons In Education), v23, n4, p8(3) 
Nov, 1995 

DOCUMENT TYPE: Buyers Guide ISSN: 0192-592X LANGUAGE: English 

RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 1394 LINE COUNT: 00119 

software, which features 3D "Tree" and "Grid" views that help one 
understand large numbers of documents at a glance. Display criteria may 
be changed on-the-fly to group files by author, date or content attribute. 

Its TextDataBase engine, based on linguistic research at the Xerox 
Palo Alto Research Center (PARC), analyzes documents as they are 
indexed into the system, matching words to their roots 



18/3, K/ 5 (Item 5 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01579865 SUPPLIER NUMBER: 13047883 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Pages , pictures , & PCs : words or images , monochrome or color : scanners and 
OCR make any data PC-compatible, (optical character recognition) (Special 
Section on Scanners , includes related articles on the scanner standard 
TWAIN, the same image scanned by different devices, tracking data, 
converting fax data to text, and buying scanners) (Buyers Guide) 
Keizer, Gregg 

Computer Shopper, vl3, nl, p382(15) 
Jan, 1993 

DOCUMENT TYPE: Buyers Guide ISSN: 0886-0556 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 6330 LINE COUNT: 00488 

... 386 and 486 computers some of the sophisticated abilities once 

reserved for minis and mainframes. PageKeeper 1 s automatic document 
input, keywording analysis , image compression, and powerful 
search-and-retrieval engine — able to use one document , hypertext-fashion, 
as search criteria to find similar ones — promise to make gathering, 
organizing , and delivering documents on your PC easier than ever before. 
When you get right down to it, scanning... 



18/3 ,K/6 (Item 6 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01549488 SUPPLIER NUMBER: 13039891 (USE FORMAT 7 OR 9 FOR FULL TEXT ) 

Personalized information delivery: an analysis of information filtering 
methods. (Information Filtering) (Technical) (Cover Story) 

Foltz, Peter W.; Dumais, Susan T. 
Communications of the ACM, v35, nl2, p51(10) 
Dec, 1992 

DOCUMENT TYPE: Cover Story ISSN: 0001-0782 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 7965 LINE COUNT: 00629 

the analysis performed by SVD geometrically. The result of the SVD 
is a k-dimensional vector space containing a vector for each term and 
each document - The location of term vectors reflects the 
correlations in their usage across documents . Similarly, the location 
of document vectors reflects correlations in term usage. In this 
space the cosine or dot product between vectors corresponds to their 
estimated similarity. Retrieval proceeds by using the terms in a query to 



18/3, K/12 (Item 2 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04632438 SUPPLIER NUMBER : 16452979 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Visual Recall vs. PageKeeper 2.0: filing cabinets of the future. (XSoft; 
Caere Corp) (Software Review) (Evaluation) (Brief Article) 

O'Malley, Chris 
PC/Computing, v8, n3, p79(l) 
March, 1995 

DOCUMENT TYPE: Evaluation Brief Article ISSN: 0899-1847 

LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 585 LINE COUNT: 00053 

s OmniPage program, and a powerful search engine that automatically 
indexes the full text of documents and recognizes similarities by 
spotting root words . There's also PageKeeper 's hallmark, a prioritizing 

feature called Weighted Relevance Retrieval: Word searches produce a 
color-coded and startlingly accurate ranking of relevant documents . 

However, there 1 s nothing unearthly about PageKeeper 1 s price. At less 
than $100 on the. . . 



18/3, K/15 (Item 2 from file: 636) 

DIALOG (R) File 636: Gale Group Newsletter DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

03923761 Supplier Number: 50163274 (USE FORMAT 7 FOR FULLTEXT) 
New Services Aim To Boost Efficiency Of Search Engines 

Electronic Advertising & Marketplace Report, vl2, nl3, pN/A 
July 14, 1998 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 861 

. . . take a more proactive role in helping marketers create more 

effective search listings. WebPosition Gold features Page Generator, 
which collects keywords and details about a business and generates HTML 
doorway pages for selected search engines. An additional service is page 
critic, which compares search pages to the top ranking pages on a 
search engine. Since the universal demand from Web marketers is for 
statistics, WebPosition... 



18/3, K/27 (Item 2 from file: 553) 

DIALOG (R) File 553: Wilson Bus. Abs . FullText 
(c) 2004 The HW Wilson Co. All rts. reserv. 

04286454 H.W. WILSON RECORD NUMBER: BWBA00036454 (USE FORMAT 7 FOR 

FULLTEXT) 

Verifying the proximity and size hypothesis for self-organizing maps . 

AUGMENTED TITLE: Kohonen Self-organizing Map unsupervised learning 

technique 

Lin, Chienting 

Chen, Hsinchun; Nunamaker, Jay F 

Journal of Management Information Systems (J Manage Inf Syst) v. 16 no3 
(Winter 1999/2000) p. 57-70 
LANGUAGE: English 
WORD COUNT: 4 4 01 

(USE FORMAT 7 FOR FULLTEXT) 

TEXT: 

... of terms in decreasing order of frequency for the entire 

collection. Based on the indexing terms identified, each document then 
is represented by a term vector of 1 or 0. The number of Is in each 
document is equal to the number of terms in the document , and each 
vector position corresponds with one unique term . 

We chose a 20*10 grid map for displaying SOM outputs, based on what 
would. . . 

...of running the same input file against the trained map and reporting the 
map grid location that is the closest in Euclidean distance to each 
input. Each document ( vector ) and each term (represented as a unit 
vector ) were thus mapped to a node and also to a region (of the same nodes 
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26/5/9 (Item 9 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 



015738544 **Image available** 
WPI Acc No: 2003-800745/200375 

Related WPI Acc No: 2000-292891; 2000-338800; 2004-419529 

XRPX Acc No: N03-641642 

Profile terms managing method for knowledge management, involves 
selecting profile terms and allocating terms to private portion 
having confidence value based on contextual characteristic of profile 
terms 

Patent Assignee: TACIT KNOWLEDGE SYSTEMS INC (TACI-N) 

Inventor: EPELMAN-WANG H; GILMOUR D L; GOLDBERG J M; SANDERS R; WANG E 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6640229 Bl 20031028 US 98156468 A 19980918 200375 B 

US 99271022 A 19990317 

US 2000697700 A 20001025 

Priority Applications (No Type Date) : US 2000697700 A 20001025; US 98156468 

A 19980918; US 99271022 A 19990317 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 6640229 Bl 57 G06F-017/30 CIP of application US 98156468 

CIP of application US 99271022 

Abstract (Basic) : US 6640229 Bl 

NOVELTY - The method involves selecting profile terms allocated 
to a private portion of a user profile according to a pre-determined 
criterion. The selected profile terms are moved to a public portion 
of the user profile based on another criterion. The profile terms 
allocated to the private portion have a confidence value based on a 
contextual characteristic of the profile terms within an electronic 
document . 

DETAILED DESCRIPTION - The pre-determined criteria are threshold 
levels for the confidence value. A treatment option is associated for a 
profile term to determine when the profile term is moved to the 
public portion. 

An INDEPENDENT CLAIM is also included for a computer readable 
medium having stored computer-executable instructions for performing a 
method to automatically allocate profile terms between a public and 
private profile of a user. 

USE - Used for managing a user knowledge profile within a database. 

ADVANTAGE - The method increases the public knowledge resources of 
the organization without over-burdening the user. The user can 
utilize the agent to automatically publish profile terms rather than 
having to continually decide which terms should be published and the 
user maintains control over the criteria used by the agent in 
publishing the profile terms . 

DESCRIPTION OF DRAWING (S) - The drawing shows a flowchart of a 
method of performing a profile modification process, 
pp; 57 DwgNo 17E/27 
Title Terms: PROFILE; TERM ; MANAGE; METHOD; MANAGEMENT; SELECT; PROFILE; 
TERM ; ALLOCATE; TERM ; PRIVATE; PORTION; CONFIDE; VALUE; BASED; 
CHARACTERISTIC ; PROFILE; TERM 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 

File Segment: EPI 



26/5/9 (Item 9 from file: 350) 

DIALOG (R) File 350 : Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 



015738544 **Image available** 
WPI Acc No: 2003-800745/200375 

Related WPI Acc No: 2000-292891; 2000-338800; 2004-419529 

XRPX Acc No: N03-641642 

Profile terms managing method for knowledge management, involves 
selecting profile terms and allocating terms to private portion 
having confidence value based on contextual characteristic of profile 
terms 

Patent Assignee: TACIT KNOWLEDGE SYSTEMS INC (TACI-N) 

Inventor: EPELMAN-WANG H; GILMOUR D L; GOLDBERG J M; SANDERS R; WANG E 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6640229 Bl 20031028 US 98156468 A 19980918 200375 B 

US 99271022 A 19990317 

US 2000697700 A 20001025 

Priority Applications {No Type Date) : US 2000697700 A 20001025; US 98156468 

A 19980918; US 99271022 A 19990317 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 6640229 Bl 57 G06F-017/30 CIP of application US 98156468 

CIP of application US 99271022 

Abstract (Basic) : US 6640229 Bl 

NOVELTY - The method involves selecting profile terms allocated 
to a private portion of a user profile according to a pre-cletermined 
criterion. The selected profile terms are moved to a public portion 
of the user profile based on another criterion. The profile terms 
allocated to the private portion have a confidence value based on a 
contextual characteristic of the profile terms within an electronic 
document . 

DETAILED ' DESCRIPTION - The pre-determined criteria are threshold 
levels for the confidence value. A treatment option is associated for a 
profile term to determine when the profile term is moved to the 
public portion. 

An INDEPENDENT CLAIM is also included for a computer readable 
medium having stored computer-executable instructions for performing a 
method to automatically allocate profile terms between a public and 
private profile of a user. 

USE - Used for managing a user knowledge profile within a database. 

ADVANTAGE - The method increases the public knowledge resources of 
the organization without over-burdening the user. The user can 
utilize the agent to automatically publish profile terms rather than 
having to continually decide which terms should be published and the 
user maintains control over the criteria used by the agent in 
publishing the profile terms . 

DESCRIPTION OF DRAWING (S) - The drawing shows a flowchart of a 
method of performing a profile modification process, 
pp; 57 DwgNo 17E/27 
Title Terms: PROFILE; TERM ; MANAGE; METHOD; MANAGEMENT; SELECT; PROFILE; 
TERM ; ALLOCATE; TERM ; PRIVATE; PORTION; CONFIDE; VALUE; BASED; 
CHARACTERISTIC ; PROFILE; TERM 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 

File Segment: EPI 



26/5/13 (Item 13 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015378257 **Image available** 

WPI Acc No: 2003-439195/200341 

XRPX Acc No: N03-350407 

Patient record searching method for health care information system, 
involves initiating search of patient record , based on search criteria 

including information for identifying particular document and patient 
visit 

Patent Assignee: BOWEN S W (BOWE-I) 
Inventor: BOWEN S W 

Number of Countries; 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030036927 Al 20030220 US 2001313662 P 20010820 200341 B 

US 2002219547 A 20020815 

Priority Applications (No Type Date): US 2001313662 P 20010820; US 

2002219547 A 20020815 
Patent Details : 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 20030036927 Al 21 G06F-007/00 Provisional application US 2001313662 
Abstract (Basic) : US 20030036927 Al 

NOVELTY - A search criteria including information for identifying 
a particular document , a particular patient visit and a particular 
organization , is received. Another criteria identifying a category 
of the search criteria is received. A search of the patient record 
in initiated, based on both criteria . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) user interface provision method; and 

(2) record explorer. 

USE - For searching patient record in clinical and health care 
information systems used by health care provider such as hospital, 
nursing home, assisting living care arrangement , home health care 
arrangement , hospice arrangement , critical care arrangement , 
health care clinic, physical therapy clinic, chiropractic clinic and 
dental office for monitoring health and welfare of patient. 

ADVANTAGE - Allows users to define their desired search criteria 
easily. Provides the ability search for data related to clinically 
relevant keywords and concepts . 

DESCRIPTION OF DRAWING (S) - The figure shows the health care 
information system. 

pp; 21 DwgNo 1/10 

Title Terms: PATIENT; RECORD ; SEARCH; METHOD; HEALTH; CARE; INFORMATION; 

SYSTEM; INITIATE; SEARCH; PATIENT; RECORD ; BASED; SEARCH; CRITERIA ; 

INFORMATION; IDENTIFY; DOCUMENT ; PATIENT; VISIT 
Derwent Class: SOS; T01 

International Patent Class (Main) : G06F-007/00 

International Patent Class (Additional) : G06F-017/00 ; G06F-017/30 ; 
G06F-017/60 

File Segment: EPI 



26/5/15 (Item 15 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 

015194132 **Image available** 

WPI Acc No: 2003-254666/200325 

XRPX Acc No: N03-201886 

Method for summing up repeat of string in data document which is 
greatly useful for super high-capacity of data document , especially for 
data document that has strings with higher frequency of display 

Patent Assignee: INVENTEC BESTA CO LTD (INVE-N) 

Inventor: CHEN H 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

TW 490622 A 20020611 TW 2000126955 A 20001215 200325 B 

Priority Applications (No Type Date): TW 2000126955 A 20001215 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
TW 490622 A G06F-017/30 

Abstract (Basic) : TW 490622 A 

NOVELTY - Provided is a method for summing up repeat of string in 
data document . This method uses the biased address in the data 
document of the first characters between two identical characters in 
sequential arrangement as the index value according to intervals 
among each identical character when an electric product proceeds 
compressing for a data document stored inside it, and uses 
corresponding character interval to proceed arrangement to create an 
index table. When summing up for string that has high frequency of 
display in the data document , it is able to replace real string in 
the data document with the index value and use the index table as 
the index for comparison . It records frequency that strings 
display repeatedly according to the number of matched words and 
sums up repeat frequency of strings of number of different matched 
words to fast complete summing up frequency of strings that display 
repeatedly . 

DETAILED DESCRIPTION - It also makes the electric product obtain 
feature of the data document according to the results after summing 
up to use the best compressing solution to compress the data document 
for increasing data compression efficiency. This method is greatly 
useful for super high-capacity of data document , especially for data 
document that has strings with higher frequency of display. 
DwgNo 1/1 

Title Terms: METHOD; SUM; UP; REPEAT ; STRING; DATA; DOCUMENT ; USEFUL; 

SUPER; HIGH; CAPACITY; DATA; DOCUMENT ; DATA; DOCUMENT ; STRING; HIGH; 

FREQUENCY; DISPLAY 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



26/5/17 (Item 17 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

014835504 **Image available** 

WPI Acc No: 2002-656210/200270 

XRPX Acc No: N02-518669 

Entity location identification method in multilingual environment, 
involves constructing translation table including at least a pair of 
natural language term for descriptor 

Patent Assignee: JAQUA E (JAQU-I) 

Inventor: JAQUA E 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20020099533 Al 20020725 US 2001768989 A 20010123 200270 B 

Priority Applications (No Type Date) : US 2001768989 A 20010123 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020099533 Al 22 G06F-017/20 

Abstract (Basic) : US 20020099533 Al 

NOVELTY - An entity identification and at least one 
characteristic such as product and service of the entity are stored in 
specified language in an entity record . A translation table (98) 
including at least a pair of natural language terms for a descriptor 
is constructed. A specified natural language term is used as a search 
parameter. The record for the entity characteristics corresponding 
to the search term is searched. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following : 

(1) Method of communication in multilingual environment; and 

(2) Data processing system. 

USE - For identifying location of businesses or other entities 
engaged in activities of interest in multilingual environment. 

ADVANTAGE - Facilitates accurate location of entities and 
communication between entities that do not share a common language, 
thereby avoiding a barrier created by language. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the data processing system. 

Translation table (98) 

pp; 22 DwgNo 2/10 

Title Terms: ENTITY; LOCATE; IDENTIFY; METHOD; ENVIRONMENT; CONSTRUCTION; 
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Document data processor e.g. wordprocessor , has edit unit which reads 

out object and index data from respective memory and displays it based 

on object display demand sent from user 
Patent Assignee: NEC CORP (NIDE ) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 2001273314 A 20011005 JP 200087961 A 20000328 200175 B 

Priority Applications (No Type Date): JP 200087961 A 20000328 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2001273314 A 13 G06F-017/30 

Abstract (Basic) : JP 2001273314 A 

NOVELTY - A storage unit (22) stores positional information of an 
object in a document based on analysis information about the 
object. A management unit (31) manages the index information 
synthesized with analysis and stored positional information of the 
object. An edit unit (42) reads out the object and index data from a 
pair of memories (32,33) and displays it, based on the object display 
demand sent from an user. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Document data processing method; 

(b) Recording medium storing document data processing program 
USE - Document data processor e.g. wordprocessor . 

ADVANTAGE - Utilization of the object in a diagram or the table is 
made simple, based on the index of the object. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the document data processor. (Drawing includes non-English language 
text) . 

Storage unit (22) 

Management unit (31) 

Memories ( 32 , 33 ) 

Edit unit (42) 
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Document classification for information retrieval system, involves 
comparing created term and document vectors and storing document 
at location relative to category node with term vector with preset 
relevance ranking 

Patent Assignee: SUN MICROSYSTEMS INC (SUNM ) 

Inventor: MOCKER J D; SNOW W A 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6185550 Bl 20010206 US 97874783 A 19970613 200152 B 

Priority Applications (No Type Date): US 97874783 A 19970613 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6185550 Bl 19 G06F-017/30 

Abstract (Basic) : US 6185550 Bl 

NOVELTY - Term vectors containing weights assigned to each of one 
or more common terms in the corresponding terms file are created 
and are compared with created document vectors of a document to 
provide relevance ranking between the terms file and document . 
The document is stored at a location corresponding to category node 
having a term vector which has a relevance ranking that matches a 
selected criteria . 

DETAILED DESCRIPTION - A class hierarchy is created by providing 
several category nodes, each of which create term files . Class 
hierarchy having a root category node within a free data structure is 
initialized and displayed. User selected commands for manipulating the 
class hierarchy are entered. A category command is processed in 
response to the user selected command having predefined state which 
causes the class hierarchy to contain several category nodes. Category 
nodes include category name, node type, node ID, parent ID, link ID 
which are all stored in the database. When the node type is predefined 
type a new category node is allowed to be added to the selected 
category nodes, otherwise new category node is prevented from being 
added to the category nodes. The node ID defines the unique directory. 
The parent ID is indicating the node ID of a parent category node. The 
link ID is indicating the node ID of several category nodes when the 
node type is of a predetermined type. INDEPENDENT CLAIMS are also 
included for the following : 

(a) Document classifying; 

(b) Document classification program 

USE - For classification of documents within defined categories 
using class hierarchy in information retrieval system. 

ADVANTAGE - Since the automatic document classification within 
user defined categories is provided, the user can interactively 
search for documents according to search terms defined within user 
defined categories . Since documents are ranked according to 
relevance and a user specified number of documents which are most 
relevant are returned, multiple users can access the document via 
network . 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart of main 
procedure utilized in creation of the document directory hierarchy, 
pp; 19 DwgNo 0/9 

Title Terms: DOCUMENT ; CLASSIFY; INFORMATION; RETRIEVAL; SYSTEM; COMPARE 
; TERM ; DOCUMENT ; VECTOR; STORAGE; DOCUMENT ; LOCATE; RELATIVE; 
CATEGORY; NODE; TERM ; VECTOR; PRESET; RELEVANT; RANK 

Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 

File Segment: EPI 



26/5/22 (Item 22 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 

013843457 **Image available** 
WPI Acc No: 2001-327670/200134 

Related WPI Acc No: 2001-273156; 2001-342950; 2001-389313 

XRPX Acc No: N01-235732 

Category data adding method for manipulating files representing 
Internet web pages , involves mapping several keywords of link data 
item to form category data which is added to the document 
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Designated States (National) : JP US 
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Abstract (Basic) : WO 200068833 A2 

NOVELTY - A document is searched for a link data item. Several 
keywords are identified within the link data item. The identified 
link keywords are mapped into a category data that is added to the 
document . 

DETAILED DESCRIPTION - The data file includes a link data item 
specifying a linked location within specific document . The link 
data item is a hypertext link that includes a universal resource 
identifier . INDEPENDENT CLAIMS are also included for the following : 

(a) apparatus for adding category data to data file ; 

(b) recording medium 

USE - For manipulating data files representing Internet web 
pages . 

ADVANTAGE - Reduces processing and bandwidth requirements, thus 
making the system better suited to real time dynamic operation. The 
document is characterized by analyzing link data rather than 
analyzing the document itself. Each document is reduced to 
suitable size without eliminating significant information to be 
displayed in low resolution display devices such as mobile telephone, 
personal digital assistant. 

DESCRIPTION OF DRAWING ( S ) - The figure shows the system for adding 
graphical data to the document . 
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Attribute extractor for structurized documents , extracts and outputs 
attribute value corresponding to indexed position , obtained by- 
comparing input document content with prestored attribute schema 

Patent Assignee: FUJI XEROX CO LTD (XERF ) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Priority Applications (No Type Date) : JP 99246880 A 19990901 
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Abstract (Basic) : JP 2001075974 A 

NOVELTY - Contents of input document (la) are compared with 
prestored attribute schema (If). Attribute name and its index 
position corresponding to document content are extracted from 
attribute schema, respectively by extractors (lb,lc). Attribute names 
for position not indexed are deleted. Attribute data corresponding 
to indexed positions is extracted by extractor (Id) and outputs the 
data as a list ( le ) . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Attribute extracting method; 

(b) Recording medium with attribute extracting program 

USE - For detecting convergence of attributes in structurized 
documents specified in standard generalized markup language (SGML), 
extensible markup language (XML) . 

ADVANTAGE - Required attribute is extracted simply without breaking 
the format of document and without being conscious of variations in 
document . 

DESCRIPTION OF DRAWING (S) - The figure shows the conceptional 
diagram of attribute extractor (The drawing includes non-English 
language text) . 

Input document (la) 

Extractors (lb-Id) 

List (le) 

Attribute schema (If) 
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Visual document indexing method for computer based image analysis , 
involves providing visual content signature for document by determining 
spatial distribution of visual keywords 
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Number of Countries: 003 Number of Patents: 004 
Patent Family: 



Patent No 


Kind 


Date 


Applicat No 


Kind 


Date 


Week 


WO 


200043910 


Al 


20000727 


WO 


99SG1 


A 


19990122 


200067 


GB 


2362078 


A 


20011107 


WO 


99SG1 


A 


19990122 


200169 










GB 


200117734 


A 


20010720 




GB 


2362078 


B 


20030122 


WO 


99SG1 


A 


19990122 


200308 










GB 


200117734 


A 


20010728 




US 


6574378 


Bl 


20030603 


WO 


99SG1 


A 


19990122 


200339 










US 


99341348 


A 


19990708 





Priority Applications (No Type Date) : WO 99SG1 A 19990122 
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Abstract (Basic) : WO 200043910 Al 

NOVELTY - Visual keywords (108) derived from visual tokens (104) 
which are extracted from visual documents (100) are compared with 
several visual tokens of other visual documents (120). Spatial 
distribution of visual keywords is determined based on comparison 
result which is represented by three dimensional map of detected 
locations of the visual keywords to provide visual content signature 
for the document (10 0) . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) visual document indexing program; 

(b) visual document indexing apparatus 

USE - For classifying and searching image data dependent upon 
content of image data for computer based image analysis . 

ADVANTAGE - Since a statistically based coding method is used to 
transform signatures into real valued vectors of lower dimensions, the 
noise in visual content signatures is reduced. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram 
illustrating visual keywords generating system. 

Visual documents (100,120) 

Visual tokens (104) 

Visual keywords (108) 
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Searching method for documents similar to one held by user in 
database records - involves establishing frequency count of specific 
character rows in user held document , and matching them against 
corresponding character row counts in test documents 

Patent Assignee: HITACHI LTD (HITA ) 

Number of Countries: 001 Number of Patents: 001 
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Abstract (Basic) : JP 11143902 A 

NOVELTY - Database held in a magnetic disk (105) has text of 
individual' documents (103) . The frequency of appearance f±le (104) 
records frequency counts of specific character rows of each document 
. From the user held document , characteristic character rows are 
identified along with the frequency counts and a comparison of such 
counts is effected to screen a document from the database. 

USE - For searching document similar to one held by user in 
database records . 

ADVANTAGE - Affords high precision high speed search of document 
in database, without need to use a word dictionary. DESCRIPTION OF 
DRAWING (S) - The sketch gives the details of main internal memory 
organization holding the control programs involved in the database 
search along with the overall layout of the associated modules. (103) 
Text; (104) Frequency of appearance file for character rows; (105) 
Magnetic disc. 

Dwg. 1/25 

Title Terms: SEARCH; METHOD; DOCUMENT ; SIMILAR ; ONE; HELD; USER; 

DATABASE; RECORD ; ESTABLISH; FREQUENCY; COUNT; SPECIFIC; CHARACTER; ROW 
; USER; HELD; DOCUMENT ; MATCH ; CORRESPOND; CHARACTER; ROW; COUNT; 
TEST; DOCUMENT 

Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 

File Segment: EPI 



26/5/35 (Item 35 from file: 350) 

DIALOG ( R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 



012289814 **Image available** 

WPI Acc No: 1999-095920/199908 

XRPX Acc No: N99-069692 

Filing method for automatically organising computer information - 
involves associating profiles with folders and extracting data from new 
files to identify which folders to store it in 
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AU 9877214 A G06F-007/00 Based on patent WO 9900724 

US 5899995 A G06F-017/30 

Abstract (Basic) : WO 9900724 A 

The filing system has folders holding information in a hierarchial 
organisation , Each folder has an associated set of profiles and 
constraint expressions. When a file is to be stored, it is scanned 
and its features extracted. These are compared with the profiles of 
each folder. If the features match the folder profiles the features 
are evaluated against the constraints. If the constraints are 
satisfied the file is stored in that folder. 

The features extracted can include simple file attributes; 
file contents both in terms of types of information, e.g. URL's, or 
actual terms . The profiles also track user changes to the structure 
to learn how information is to be filed . 

ADVANTAGE - Stores files automatically based on their contents 
rather than on an application basis. 
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Database index maintenance method - involves collating index 
entries according to file tiers which are periodically merged 
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Abstract (Basic) : US 5765168 A 

The indexing method involves indexing batches of records by 
storing index entries in a memory, each index entry including a 
word entry for each unique portion of information of the database, and 
one or more location entries pointing at occurrences of the portions 
of information. The index entries are collated according to the 
order of the word entries, and sequentially according to the 
locations of each word entry. 

The index entries are organized in a number of tiers of files 
, there at least initially being one tier of files for each batch of 
records indexed . A subsequently produced tier of files is 
periodically merged with a previously produced tier of files to 
produce a merged tier of files , the index entries being a logical 
union of the index entries of the subsequently and previous produced 
tiers of files . 

ADVANTAGE - Supports maintenance of database index as records 
are added and deleted. 
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Observing and comparing visual and textual record of repetitive or 
related events, e.g. behavioural activities - has software for searching 
and retrieving audio-visual and text data regarding selected event, for 
searching glossary of keywords , for searching relationship chart and 
searching sequentially through audio-visual records 
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Patent Details: 
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US 5682330 A 12 G06F-015/82 CIP of application US 93158010 
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Abstract (Basic) : US 5682330 A 

Repetitive or related events are observed and compared and 
recorded on a viewable media. The visual record is converted into a 
digital format, which is stored in a database at an identifiable 
location . A textual database comprising written descriptions of the 
visual record and a glossary of key words identifying repetitive 
events are established and stored, the stored information constituting 
an information library. The library is stored in a microprocessor 
system, the microprocessor programmed so that a user can search and 
retrieve multiple images of a selected event, precursor visual images 
of events prior to the event in time, responsive activity showing 
events subsequent to the selected event and contemporaneous events, all 
retrieved visual images including textual information. 

The user can automatically update, reorder, supplement, and expand 
the information library as desired by accessing the original source of 
the information through use of an electronic network, comparing the 
information stored on the system with information available at the 
original source and selectively adding new information at the original 
source to the record stored in the microprocessor system. 

ADVANTAGE - Allows behaviour or cognitive processes to be 
understood and quantified, and performance and responses to be 
predicted . 

Dwg. 1/3 

Title Terms: OBSERVE; COMPARE ; VISUAL; TEXT; RECORD ; REPEAT ; RELATED; 

EVENT; BEHAVE; ACTIVE; SOFTWARE; SEARCH; RETRIEVAL; AUDIO; VISUAL; TEXT; 

DATA; SELECT; EVENT; SEARCH; KEYWORD ; SEARCH; RELATED; CHART; SEARCH; 

SEQUENCE; THROUGH; AUDIO; VISUAL; RECORD 
Derwent Class: T01 

International Patent Class (Main) : G06F-015/82 
International Patent Class (Additional) : G06F-017/30 

File Segment: EPI 



26/5/44 (Item 44 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 



009483668 **Image available** 

WPI Acc No: 1993-177203/199322 

XRPX Acc No: N93-135811 

Electronic processing for excerpting and summarising uncoded document 
image - using automatic or iterative morphological imag^ recognition 
technique to produce summaries based on identified significant portions 
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Abstract (Basic) : EP 544432 A 

The electronic processing method involves sequencing the document 
into image units without decoding the document image. Significant 
image units are identified in accordance with selected morphological 
image characteristics . An abbreviated document image is created 
based on the identified significant image units. 

The image units are classified according to frequency of occurrence 
and according to location within the document image. The selected 
morphological image characteristics include characteristics 
defining image units having predetermined linguistic criteria and 
include an image unit shape dimension, front, typeface, pixel density 
and cross-sectional characteristic . 

ADVANTAGE - Provides document summary without decoding contents. 
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Abstract (Basic) : US 6003027 A 

NOVELTY - A CPU (120) executes categorization of several 
documents (14 0) having specific size and feature into several 
categories each corresponding to a score in result table. The 
confidence level for each category is determined by taking product of 
two credibility's and normalizing the product with another 
credibility. 

DETAILED DESCRIPTION - The credibility's are weighed by raising 
each of the credibility's to a power that is function of size of the 
document . The function is expressed by log (1+N/K), where N is size of 

document measured by number of features of the document and K is 
constant of value greater than one and less than N. The confidence 
level is determined at an optional empirical value of K. The documents 

are tagged by the category with highest confidence level to which 
document is then routed. 

An INDEPENDENT CLAIM is also included for categorization level 
verifying method. 

USE - For use during categorizing of objects such as books, 
articles, reports, pictures, movies or recordings stored in memory of 
computer in network environment e.g. LAN, WAN, internet, etc. Also for 
categorizing documents such as electronic mail , scientific 
articles, news stories, business information, sports information, 
requests to service organization , claim, letter, transcription of 
telephone calls, etc. 

ADVANTAGE - Categorization results can be accepted without need 
for human intervention by setting confidence value suitably 
corresponding to selected error rate. As confidence level of document 

categorizing is improved efficiently, documents can be routed or 
tagged with the category to proper destination. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
confidence level determining system. 

CPU (120) 
Document (14 0) 

pp; 16 DwgNo 1/8 
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Method for resolution of natural -language queries against full -text 
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INTERNATIONAL PATENT CLASS: G06F-017/27 ... 
. . . G06F-017/30 

...SPECIFICATION present invention are completely integrated, more advanced 
mechanisms for matching the syntax of the query against the syntax of 
the matching terms in the document can be employed. 

6. Term Specificity and Information Content Certain terms, such as 
"stuff", "things", and "information", are especially vague... 

...CLAIMS senses are semantically linked and to develop link strengths 
indicating the semantic closeness of the term senses. 

9. The method of claim 1 wherein step (c) comprises a ranking according 

to at least one of the following criteria : inverse document 
frequency; syntactic position ; part of speech; application of a 
predetermined concept tree; part of speech; predetermined ranking 
; and explicit selection by a human user. 

10. The method of Claim 1 wherein step (d) of identifying documents 
further comprises the steps of: 

(dl) identifying in a rank order, sets of documents within said 

database containing one or more documents which contain at least one 
said likely. . . 
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...SPECIFICATION it can be recognized for later conversion.) 

E. Perform all table checks, including a check against the 
above-mentioned table to see if the word is important (if not, the 
process ends) and, if it is, a check of the existing search word table 
to see if the search word already exists. 

F. If the search word is not in the table, add it. 

It will be apparent that such criteria can be changed to suit the 
business practices and policies of the organization ; a government 
bureau will have quite different criteria from a manufacturing 
company. The general approach, however, is likely to be quite the same in 
that essential identifying material is extracted from each document 
such that the document can be located and retrieved again , as needed, 
with minimal recall of specific information. Furthermore, the essential 
identifying information is extracted. . . 
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Detailed Description 

Detailed Description 

. . . score is a measure of the degree to which the document relates to the 
spatial location mentioned in its spatial information content. In cases 
where the document has more than one... 

...of spatial information content, the document is scored against each 
instance . 

The spatial-keyword document indexer examines each document in the 
spatial I 0 document collection and represents it in an spatial-keyword 
document index data structure. The spatial-keyword document indexer 

indexes a document both by keywords and by at least one instance of 
spatial information content. The spatial-keyword document index 
enables unusually fast responses by the computer system to queries that 
combine spatial criteria with keyword criteria . 

The crawler extends the collection of known documents by examining the 
hyperlinks contained in the known documents . When a hyperlink 
references a previously unknown document , the crawler adds the unknown 
document to the collection of known documents and examines them, in 
turn, for new hyperlinks to follow. 

The crawler may prioritize the... to index arbitrarily complex sets of 
documents . 



The search 50 process uses the spatial document index 503 and spatial- 
keyword document index 50,5 to find documents that refer to a given 



set of domain locations or regions, and documents related to a given 
set of keywords existing in the word lexicon 225. The search 50 
process can also find documents using a lexical tree 508, such as might 
represent a filter. Thus, the search 50 process can respond to queries 
that seek documents according to spatial domain criteria , keyword 
criteria , filters, or any combination thereof. Furtherinore , the search 
50 process can invoke the document ranker 56 process to rank the 
result set of documents by relevance to the query terms. 
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Detailed Description 

. . . query. The contexts are obtained from a body of information such as a 
collection of documents . The query may consist of fully specified terms 
or partially and/or unspecified terins . The... 

...the query can comprise contexts themselves or portions thereof, in 
addition to or instead of document identifiers or locations . A 
feature of the invention in this aspect is that rather than searching 
documents containing query terms, the system searches contexts that 
contain the terms. 

In another aspect, the invention is a system for creating an index 
identifying 1 0 contexts for terms, the contexts occurring within 
documents in a database. The system first selects a document in the 
database and then selects a term within the document. The system 
identifies contexts for the term within the document, the contexts 
corresponding to strings within the document that contain the term . . 
The system further stores information identifying the document and 
associating the 1 5 document identifier with the context. Thus, a given 
term may have multiple contexts within several document For a given 
term, in addition to storing. . . 

. . .preferably all, of the contexts for the term within a given document, 
the system selects another term within .the document and identifies 

contexts for that term within the document . The context 
identification process continues for the selected document until a set 
of contexts has been identified f or . . . f rom the 157h entry in array FT 52 



it is known how many more documents following that first document ma 
be identified as containing the term Senator. The term JNHUAV is 
similarly processed. Suppose that it is determined that the terms appear 
in the. . .context identifier (doc20) for doc7 satisfying the Boolean 

53 

expression, match engine 34 would then compare the next document 
identifier of term 

Senator, doc8, to doc7 of term 
[NHUM] . 

Steps 505 through 513 are repeated until either no more documents and 
contexts containing potential matches are found or until no... 
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Detailed Description 

. . . it is necessary to devise easily computable yet accurate means for 
approximating relevance. A basic index forniat provides record 
identifications (' document IDs') against particular index terms, 
and when a word query is entered by a user this is simply matched 
against the index terms to ... returned to the user. 

If a user submits a search query comprising more than one word , such as 
the boolean query ("b" AND "c" AND "d 1 )/ the search engine scans word 
list 330 for each of the words "b, " 64c" and "X 1 , then examines the 
positional data associated with each character to determine whether they 
occurred in the same document . If more than one instance is found where 
all of the words in the search query occur on the same page , those 
instances can be differentiated by looking at further criteria such as 
the position of those words on each page . Once again, in accordance 
with the methodology of the present invention, the closer the search 
terms are to the top of a document identified , the higher the rank 
(presumed pertinence) of that document . In addition, the number of 
occurrences of those words on each page can be applied to filter the 
results (the more occurrences, the higher the rank ) , as well as the 
proximity of those words to one another on each page (the closer they 
are, the higher the rank) . 



Figure 4 is a diagram showing in detail a preferred way in which basic 
positional data is recorded for each unique indexable item on a given 



page of stored information. 
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Detailed Description 

score is a measure of the degree to Which the document relates to the 
spatial location mentioned in its spatial information content. In case 
where the document has more than one... 

...of spatial information content, the document is scored against each 
instance . 

The spatial-keyword document indexer examines each document in the 
spatial document collection and represents it in an spatial-keyword 
document index data structure. 

The spatial -keyword document indexer indexes a document both by 
keywords and by at least one instance of spatial information content. Th 
spatial-keyword document index enables unusually fast responses by 
the computer system to queries that combine spatial criteria with 
keyword criteria . 

The crawler extends the collection of known documents by examining the 
hyperlinks contained in the known documents . When a hyperlink 
references a previously unknown document , the crawler adds the unknown 
document to the collection of known documents and examines them, in 
turn, for new hyperlinks to follow. 

The crawler may prioritize the... icon class has first visual 
characteristics and the icon of the second icon class has second 
.visual characteristics that are different from the visual 
characteristics associated with the first icon class. At least some of 
the records identified by the plurality of record identifiers are of 
a first type and at least some performed with a spatial- keyword 
document index . 

In general, in still another aspect, the invention is a method of 



retrieving a plu rality of record identifiers each. . . 

.layer identifier as specified by search criteria, wherein such 

retrieving is performed with a spatial- keyword document index . 

Preferred embodiments include one or more of the following features. 
The spatialkeyword document index includes a spatial index tree extended 
to reference documents and... of tree addresses of leaves in a trimmed 
result tree. For example, given a candidate word or phrase ', the SMI 
4 38 queries the spatial- keyword document index 505 to get the 
trimmed result tree for this word or phrase and performs the following 
operation on this list of addresses. 

From the tree, the SMI 438 creates a list... to index arbitrarily complex 
sets of documents. 

The search 50 process uses the spatial document index 503 and spatial- 
keyword document index 505 to find documents that refer to a given set 
of domain locations or regions, and documents related to a given set of 
keywords exisiting in the word lexicon 225. 

The search 50 process can also find documents using a lexical tree 508, 
such as might represent a filter. Thus, the search 50 process can respond 
to queries that seek documents according to spatial domain criteria , 
keyword criteria , filters, or any combination thereof. 

Furthermore, the search 50 process can invoke the document ranker 56 
process to rank the result set of documents by relevance to the query 
terms . 

The search 50 process answers queries via the procedure in Fig. 6. A 
query includes at least one of the following : a bounding region 
specifying a closed shape (typically a polygon in two dimensions) , words 
, phrases , and layers. The bounding region can be the domain frame from 
the map interface 80... 
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Method of automatic classification and archiving of a document 
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NOVELTY - The automatic identification uses a computer to analyze 
data contained in the document according to its content and/or 
positioning in the document, and compares these with set identification 
criteria for the document based on content and/or position. Where a 
match is not obtained a new set of measures and criteria are invoked. 

USE - Automatic classification and archiving of document such as 
client record 

ADVANTAGE - Automates inspection of document to extract 
classification data from the document. 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of 
the procedure. 
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The distribution and cable storage box (1) is fitted to the rear or 
bottom of a computer workstation. The box has top and bottom plates 
(2,3) connected to bands (5) that form the sides of the box. The box 
houses power distribution outlets to connect power cables for the 
computer and all its ancillary devices. There is space inside the box 
to accept excess lengths of power cable. 

The box is parallelepipedic in shape and has the approximate 
dimensions of a desk-top computer case. The bands that form the sides 
of the case have holes (7) to allow passage of cables and to give 
access to electrical power outlets. Unused holes are closed by metal 
strips (6), The bands forming the sides can slide round the periphery 
of the box. Ancillary devices that are supplied with a transformer are 
connected to outlets at the top of the box, and a protective cover (9) 
is placed over these transformers. 

ADVANTAGE - Allows computer peripherals, such as printers, screen, 
scanners, facsimile machines and communication units, to be activated 
with single switch, and collects spare length of cable to prevent 
tangling and to give better appearance. 
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Abstract: A dynamic analytical file organization is presented which 
enables the file designer to estimate file performance and cost 
against time and offers a quantitative solution to file organization 
problems. Costs, measured in terms of processing time, reorganization and 
storage costs, are obtained from the input data characteristics , user 
requirements and hardware specifications. Highly elusive costs such as 
those for software maintenance or system storage are not included. Six 
common file organizations are featured and the usefulness of the 
model as an operational tool is exhibited by means of a representative 
series of demonstrations with different file sizes and types. 17 ref s . 
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ABSTRACT: This paper introduces a new document list viewer and browser, 
which are convenient tools that apply natural language processing to 
support document utilization. The document utilization process 
involves a cycle in which required documents are first retrieved, 
then analyzed by content. The user can then formulate ideas and 
create another document . Both tools help the user perform essential 
steps in this cycle. The document list viewer provides a well- 
organized list of documents by clustering documents based on 
content similarities , and by attaching precise summaries generated 
automatically, the user can quickly find a required document even 
from a long list of retrieved documents . With the automatic text 
summarization function, the document browser allows users to skim 
through a long document on a computer display, then interactively 
creates customized summaries. This paper describes the features of 
these new tools and discusses the requirements for document selection 
and browsing, (author abst . ) 
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